Table 12: IRES Sequences 
SEQ ID NO:_ A 519 base pair IRES obtainable from 
encephelomycarditis virus (EMCV). 

1 GACGTCGACTAATTCCGGTTATTTTCCACCATATTGCCGTCTTTTGGCAA 

Sail 

51 TGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGG 

101 GTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAG 

151 GAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGAC 

2 01 CCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCC 

2 51 AAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGC 

3 01 CACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAG 
351 CGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGG 

4 01 GATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
451 TTAAAAAACGTCTAGGCCCGCCGAACCACGGGGACGTGGTTTTCCTTTGA 

Sail 

5 01 AAAACACGATGTCGACGTC 



SEQ ID NO: An IRES obtainable from vascular endothelial 

growth factor (VEGF). 

1 ACGTAGTCGACAGCGCAGAGGCTTGGGGCAGCCGAGCGGCAGCCAGGCCC 
Sail 

5 1 CGGCCCGGGCCTCGGTTCCAGAAGGGAGAGGAGCCCGCCAAGGCGCGCAA 

101 GAGAGCGGGCTGCCTCGCAGTCCGAGCCGGAGAGGGAGCGCGAGCCGCGC 

151 CGGCCCCGGACGGCCTCCGAAACCATGGTCGACACGTA 

Sail 



SEQ ID NO:_ A 5'UTR region of HCV. 

1 

GCCAGCCCCCTGATGGGGGCGACACTCCGCCATGAATCACTCCCCTGTGAGGAACTACTG 

61 

TCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGAC 
121 CCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAG 
181 GACGACCGGGTCCTTTCTTGGATTAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCC 
241 GCAAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGG 
3 01 GTGCTTGCGAGTGCCCCGGGAGGTCTCGTAGACCGTGCACC (341) 
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SEQ ID NO:_ A 5'UTR region of BiP SEQ ID NO:4 



CCCGGGGTCACTCCTGCTGGACCTACTCCGACCCCCTAGGCCGGGAGTGAAGGCGGGACT 



61 



D TGTGCGGTTACCAGCGGAAATGCCTCGGGGTCAGAAGTCGCAGGAGAGATAGACAGCTGC 

121 TGAACCAATGGGACCAGCGGATGGGGCGGATGTTATCTACC ATTGGTGAACGTTAGAAAC 

181 GAATAGCAGCCAATGAATCAGCTGGGGGGGCGGAGCAGTGACGTTTATTGCGGAGGGGGC 

241 CGCTTCGAATCGGCGGCGGCCAGCTTGGTGGCCTGGGCCAATGAACGGCCTCCAACGAGC 

3 01 AGGGCCTTCACCAATCGGCGGCCTCCACGACGGGGCTGGGGGAGGGTATATAAGCCGAGT 
10 3 61 AGGCGACGGTGAGGTCGACGCCGGCCAAGACAGCACAGACAGATTGACCTATTGGGGTGT 

421 TTCGCGAGTGTGAGAGGGAAGCGCCGCGGCCTGTATTTCTAGACGTGCCCTTCGCCTGGT 

481 TCGTGGCGCCTTGTGACCCCGGGCCCCTGCCGCCTGCAAGTCGAAATTGCGCTGTGCTCC 

541 TGTGCTACGGCCTGTGGCTGGACTGCCTGCTGCTGCCCAACTGGCTGGCAAGATG (595) 

15 SEQ ID NO:_ A 5'UTR of PDGF SEQ ID N0:5 

□ 1 

GTTTGCACCTCTCCCTGCCCGGGTGCTCGAGCTGCCGTTGCAAAGCCAACTTTGGAAAAA 

61 

gttttttgggggagacttgggccttgaggtgcccagctccgcgctttccgattttggggg 

j20 121 ctttccagaaaatgttgcaaaaaagctaagccggcgggcagaggaaaacgcctgtagccg 

.| 181 gcgagtgaagacgaaccatcgactgccgtgttccttttcctcttggaggttggagtcccc 

_ 241 tgggcgcccccacacccctagacgcctcggctggttcgcgacgcagccccccggccgtgg 

' h 3 01 atgctgcactcgggctcgggatccgcccaggtagccggcctcggacccaggtcctgcgcc 

y 3 61 caggtcctcccctgccccccagcgacggagccggggccgggggcggcggcgccgggggca 

;;25 421 tgcgggtgagccgcggctgcagaggcctgagcgcctgatcgccgcggacctgagccgagc 

4 81 ccacccccctccccagccccccaccctggccgcgggggcggcgcgctcgatctacgcgtc 

541 CGGGGCCCCGCGGGGCCGGGCCCGGAGTCGGCATG (575) 
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Table 13: Literature References For IRES 



IRES Host 


Example 


Reference 


Picomaviras 


HAV 


Glass etaL, 1993. Virol 193:842-852 




HiVH^ V 


Tfina Wimmpr 1 QQO (^ptip Dpv 4*1 579 
Joii^ oc vviiiiiiici, iyy\j, vjciic jl/C/ v t-.i*juv/"i>^ /^^ 




Poliovirus 


Borman et al., 1994. EMBO J 13:3149-3157 


HCV and 
pestiviras 


HCV 


Tsukiyama-Kohara et al., 1992. J Virol 66:1476- 
1483 




BVDV 


Frolovletal., 1998. RNA. 4:1418-1435 


Leishmania 
virus 


LRV-1 


Maga et al., 1995. Mol Cell Biol 15:4884-4889 


Retroviruses 


MoMLV 
VL30 (Harvey 
mxirine sarcoma 
virus) 


Torrent et al., 1996. Hum Gene Ther 7:603-612 




REV 


Lopez-Lastra et al., 1997. Hum Gene Ther 
s-is^'n ISA*; 


Eukaryotic 
mKJNA 


BiP 

antennapedia 
mRNA 


Macejak & Samow, 1991. Nature 353:90-94 
Oh et al., 1992. Gene & Dev 6:1643-1653 




FGF-2 


Vagner et al., 1995. Mol Cell Biol 15:35-44 






rJemstem et ai., lyy/. J rJioi unem z/z.yjjo- 
9362 




IGFII 


Teerink et al., 1995. Biochim Biophys Acta 
1264:403-408 




eIF4G 


Gan & Rhoads, 1996. J Biol Chem 271:623-626 




VEGF 


Stein et al., 1998. Mol Cell Biol 18:3112-3119; 
Huezetal., 1998. Mol Cell Biol 18:6178-6190 



Table 14: TRE Sequences 

Nucleotide sequence of a human uroplakin II 5' flanking region. Position +1 
(the translational start site) is denoted with an asterisk. SEQ ID NO: (niraiber 1 of 
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SEQ E) N0:_ corresponds to position -2239 with respect to the translational start 
site). 

TCGATAGGTA CCCACTATAG GGCACGCGTG GTCGACGGCC CGGGCTGGTC 

1 50 

TGGCAACTTC AAGTGTGGGC CTTTCAGACC GGCATCATCA GTGTTACGGG 

51 100 

GAAGTCACTA GGAATGCAGA ATTGATTGAG CACGGTGGCT CACACCTGTA 

101 150 

ATCCCAACAC TCTGGGAGGC CAAGGCAGGT GGATCACTTG TGGTCAGGAG 

151 200 

TTTGAGACCA GCCTGGCCAA CATGGTGAAA CCTCATCTCT ACTAAAAATA 

201 ' 250 

CAAAAATTAG CTGGGAATGG TGGCACATGC CTATAATCCC AGTTACTCAG 

251 300 

GAGGCTGAGG CAGGAGAATC ATTTGAACCT GGGAGGCAGA GGTTGCAGTG 

301 350 

AGCCGAGATC ACGCCACTGC ACTCCAGCCT GGGTGACACA GCGAGACTCT 

351 400 

GTCTCAAAAA AAAAAAAATG CAGAATTTCA GGCTTCACCC CAGACCCACT 

401 450 

GCATGACTGC ATGAGAAGCT GCATCTTAAC AAGATCCCTG GTAATTCATA 

451 500 

CGCATATTAA ATTTGGAGAT GCACTGGCGT AAGACCCTCC TACTCTCTGC 

501 550 

TTAGGCCCAT GAGTTCTTCC TTTACTGTCA TTCTCCACTC ACCCCAAACT 

551 600 

TTGAGCCTAC CCTTCCCACC TTGGCGGTAA GGACACAACC TCCCTCACAT 

601 - 650 

TCCTACCAGG ACCCTAAGCT TCCCTGGGAC TGAGGAAGAT AGAATAGTTC 

651 700 

GTGGAGCAAA CAGATATACA GCAACAGTCT CTGTACAGCT CTCAGGCTTC 

701 750 

TGGAAGTTCT ACAGCCTCTC CCGACAAAGT ATTCCACTTT CCACAAGTAA 

751 800 

CTCTATGTGT CTGAGTCTCA GTTTCCACTT TTCTCTCTCT . CTCTCTCTCT 

801 850 
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CAACTTTCTG 
851 

5 GCACAATCTC 
901 

CCTGTCTCAG 
951 

10 

TAGTTTTTGT 
1001 

GATCTCGAAC 
15 1051 

TGCTGGGATT 
1101 

20 TTTAATAGAG 
1151 

y GGCCTCGAGT 

0 1201 
^25 

p GCATGAGCCA 
d 1251 

4 CCAGCTACCA 
30 1301 

Jl CCTACCCAGG 

"35 AGAACTCCTC 

1 1401 

AGGGAAGTTC 
1451 

40 

ATTCTCAGTC 
1501 

CAACCCCTGC 
45 1551 

TCTTCCATGG 
1601 

50 TGTGCTCCCT 
1651 

AAGCCTCAGT 
1701 



AGACAGAGTT TCACTTAGTC 

GGCTCACTGC AACCTCCACC 

CCTCCCGAGT AGCTGGGATT 

ATTTTTGGTA GAGATGGTGT 

TCCTGACCTC AGGTGATCCG 

ACAGGCATGA GCCACCACGC 

ATCAAACTCT CTGTGTTGCC 

GATCCTCCCA CCTTGGCCTC 

CTGTGCCTGG CCTCAGTTCT 

CCCAGGGTGG CTGTAGGGCT 

GCCCGGAAGA AGCCCCGACT 

CGCTTCTTTC TGATGTAGCC 

TGTCTCTTTT TCATGTTATC 

CAACAAATGG TTGCTGCCCA 

TGGCCAGGGT CAATGTCTGT 

CCACCTTCGT CCCCACCCTC 

GGCCCCCACA GCCCCTGCCT 

GCACTCCAAG AATTGAAACC 
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GCCCAGGCTG GAGTGCAGGG 

900 

TCCTGGGTTC AAGTGTTTCT 

950 

ACAGGCACAC ACCACCGCGT 

1000 

TTCGCCATAT TGGCCAGGGT 

1050 

CCCACCTCGG CCTCCCAAAG 

1100 

CCGGCTGATC TCTTTTCTAT 

1150 

TAGGCTGGTC TTGAACTCCT 

1200 

CCAAAGTGTT GAGATTACAG 

1250 

ACTACAAAAG GAAGCCAGTA 

1300 

ACAATGGAGC ACACAGAACC 

1350 

CCTCTCCCCT CCCTCTGCCC 

1400 

CAGGGCCGGA GGAGGCAGTC 

1450 

TTACGAGGTC TCTTTTCTCC 

1500 

AGGCTGACTG TGCCCACCCC 

1550 

CTCTCTGGTC TCTCCAGAAG 

1600 

CAGAGGAATC TGAAACCGCA 

1650 

CTCCCAGAGC AGCAGTACCT 

1700 

CTCAGTCTGC TGCCCCTCCC 

1750 



CACCAGAATG TTTCTCTCCC ATTCTTACCC ACTCAAGGCC CTTTCAGTAG 
1751 1800 



CCCCTTGGAG TATTCTCTTC CTACATATCA GGGCAACTTC CAAACTCATC 

1801 1850 

ACCCTTCTGA GGGGTGGGGG AAAGACCCCC ACCACATCGG GGGAGCAGTC 

1851 1900 

CTCCAAGGAC TGGCCAGTCT CCAGATGCCC GTGCACACAG GAACACTGCC 

1901 1950 

TTATGCACGG GAGTCCCAGA AGAAGGGGTG ATTTCTTTCC CCACCTTAGT 

1951 2000 

TACACCATCA AGACCCAGCC AGGGCATCCC CCCTCCTGGC CTGAGGGCCA 

2001 2050 

GCTCCCCATC CTGAAAAACC TGTCTGCTCT CCCCACCCCT TTGAGGCTAT 

2051 2100 

AGGGCCCAAG GGGCAGGTTG GACTGGATTC CCCTCCAGCC CCTCCCGCCC 

2101 2150 

CCAGGACAAA ATCAGCCACC CCAGGGGCAG GGCCTCACTT GCCTCAGGAA 

2151 2200 

CCCCAGCCTG CCAGCACCTA TTCCACCTCC CAGCCCAGCA 

2201 2239 



Nucleotide sequence of a mouse uroplakin 11 5' flanking region. The 

translational start site is denoted with an asterisk. SEQ ID NO: (number 1 of SEQ 

ID N0:6 corresponds to position -3592 with respect to the translational start site). 



CTCGAGGATCTCGGCCCTCTTTCTGCATCCTTGTCCTAAATCATTTTCAT 
1 50 

ATCTTGCTAGACCTCAGTTTGAGAGAAACGAACCTTCTCATTTTCAAGTT 
51 100 

GAAAAAAAAAAGAGGTTCAAAGTGGCTCACTCAAAGTTACAAGCCAACAC 
101 150 

TCACCACTACGAGTACAATGGCCACCATTAGTGCTGGCATGCCCCAGGAG 
151 200 

ACAGGCATGCATATTATTCTAGATGACTGGGAGGCAGAGGGGTGGCCTAG 
201 250 
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TGAGGTCAGACTGTGGACAGATCAGGCAGATGTGGGTTCTGATCCCAATT 
251 300 

CCTCAGGCCGCAGAACTACTGTGGTTCAAGAAGGGGACAAAAGGACTGCA 
301 350 

GTCCGGAACAGGAGGTCCATTTGAGAGCTGAGTGAGCAGAAGAGGAAAGT 
351 400 

GAAGAACTTCTGGGGCAAGAGCTTACCCTACTTTACAGCTTTGTTGTCTT 
401 450 

CTTTACTCCAGGGGCGTCCCTGGTACTCAGTAAATGTCTGTTGGCTTGAG 
451 500 

GAACATATGTGTAAGGAGGAAGGAGAGGGAACTTGAGGGAGTTAAGACTC 
501 550 

AAGAATCAATCAAGGAGAGGACAGCAGAGAAGACAGGGTTTGGGAGAGAG 
551 600 

ACTCCAGACATTGGCCCTGGTTCCCTTCTTGGCCACTGTGAAACCCTCCA 
601 650 
GAGGAACTGAGTGCTGTGGCTTTAAATGATCTCAGCACTGTCAGTGAAGC 
651 700 

GCTCTGCTCAAAGAGTTATCCTCTTGCTCCTGTGCCGGGGCCTCCCCCTC 
701 750 

CTCTCAGCTCCCAAACCCTTCTCAGCCACTGTGATGGCATAATTAGATGC 
751 800 

GAGAGCTCAGACCGTCAGGTCTGCTCCAGGAACCACCCATTTTCCCCAAC 
801 850 

CCCAGAGAAAGGTCCTAGTGGAAAAGTGGGGGCCACTGAAGGGCTGATGG 
851 900 

GGTTCTGTCCTTTCCCCCATGCTGGGTGGACTTAAAGTCTGCGATGTGTG 
900 950 

TAGGGGGTAGAAGACAACAGAACCTGGGGGCTCCGGCTGGGAGCAGGAGG 
951 1000 

AACTCTCACCAGACGATCTCCAAATTTACTGTGCAATGGACGATCAGGAA, 
1001 1050 

ACTGGTTCAGATGTAGCTTCTGATACAGTGGGTCTGAGGTAAAACCCGAA 
1051 1100 
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ACTTAATTTCTTTCAAAAATTTAAAGTTGCATTTATTATTTTATATGTGT 
1101 1150 

GCCCATATGTGTGCCACAGTGTCTATGTGGAGGTCAGAGGGCAAGTTGTG 
1151 1200 

GGCATTGGCTCTCTCCTTTCATAATGTGGCTTCTGGGGACCAAAATGTCA 
1201 1250 

GGCATGGTGGCAAGAGCTTTTACCTGTTGAGCCATCTCATGGTTTCGTAA 
1251 1300 

AACTTCCTATGACGCTTACAGGTAACGCAGAGACACAGACTCACATTTGG 
1301 1350 

AGTTAGCAGATGCTGTATTGGTGTAAACACTCATACACAGACACACACAC 
1351^ 1400 

ATACTCATACACACACACACACACTTATCACATGCACACACATACTCGTA 
1401 1450 

TACACACAGACACACACACATGCACTCTCACATTCACATATTCATACACA 
1451 1500 

TCCACACACACACTCATCCACACACACAGACACACATACTCATCCACACA 
1501 1550 

CACACACACACATACTCATACACACACACAGACACACATACTCATACACA 
1551 1600 

CACACAGACACACACATATAATCATACATACACAGACACACTCATACATG 
1601 1650 

TGCACACACACACTCATCCACACACACACACTCATACACACACACACTCA 
1651 1700 

TACACACACACACTCATACACACACACACGAGGTTTTTCTCAGGCTGCCT 
1701 1750 

TTGGGTGGAGACTGGAACTGATTTCTGTTTTTCAGCTCCTTGGCTTTTTG 
1751 1800 

TCCCTTTAGATGAGATCTCCTCCTCACTTTACACACAGAAAGATCACACA 
1801 1850 

CGAGGGAGAACTGGCGGTGCGGAAGAGGGCTACACGGTAGGGTGTCAGGG 
1851 1900 

TCAGGAGATCTTCCTGGCAAGTCTCAAACCTCCACATAGCACAGTGTTTA 
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1901 



1950 



CGTGAGGATTTAGGAGGAATCAGGAAGAGGATTGGTTTACTGCAGAGCAG 
1951 2000 

5 

ACCATATAGGTCCACTCCTAAGCCCCATTTGAAATTAGAAGTGAGACAGT 
2001 2050 

GTGGGATAAAAAGAGCAGATCTCTGGTCACATTTTTAAAGGGATATGAGG 
10 2051 3000 

GTCCTGTGCCTTTAAGCCTTCCCATCTCCCTCCAATCCCCCCTCACCTTC 
2101 2150 

1 5 CCCACCCTAACCCTCCCCAGGTTTCTGGAGGAGCAGAGTTGCGTCTTCTC 

2151 2200 

CCTGCCCTGCCGAGCTGCTCACTGGCTGCTCTAGAGGCTGTGCTTTGCGG 
2201 2250 



20 



35 



50 



TCTCCATGGAAACCATTAGTTGCTAAGCAACTGGAGCATCATCTGTGCTG 
2251 2300 



AGCTCAGGTCCTATCGAGTTCACCTAGCTGAGACACCCACGCCCCTGCAG 
jlS 2301 2350 

CCACTTTGCAGTGACAAGCCTGAGTCTCAGGTTCTGCATCTATAAAAACG 
2351 2400 

lO AGTAGCCTTTCAGGAGGGCATGCAGAGCCCCCTGGCCAGCGTCTAGAGGA 
I 2401 2450 

i GAGGTGACTGAGTGGGGCCATGTCACTCGTCCATGGCTGGAGAACCTCCA 

2451 2500 



TCAGTCTCCCAGTTAGCCTGGGGCAGGAGAGAACCAGAGGAGCTGTGGCT 
2501 2550 



GCTGATTGGATGATTTACGTACCCAATCTGTTGTCCCAGGCATCGAACCC 
40 2551 2600 

CAGAGCGACCTGCACACATGCCACCGCTGCCCCGCCCTCCACCTCCTCTG 
2601 2650 

45 CTCCTGGTTACAGGATTGTTTTGTCTTGAAGGGTTTTGTTGTTGCTACTT 

2651 2700 

TTTGCTTTGTTTTTTCTTTTTTAACATAAGGTTTCTCTGTGTAGCCCTAG 
2701 2750 
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CTGTCCTGGAACTCACTCTGTAGACCAGGCTGGCCTCAAACTCAGAAATC 
2751 2800 



CACCTTCCTCCCAAGTGCTGGGATTAAAGGCATTCGCACCATCGCCCAGC 
5 2801 2850 

CCCCGGTCTTGTTTCCTAAGGTTTTCCTGCTTTACTCGCTACCCGTTGCA 
2851 2900 

10 CAACCGCTTGCTGTCCAAGTCTGTTTGTATCTACTCCACCGCCCACTAGC 

2901 2950 

CTTGCTGGACTGGACCTACGTTTACCTGGAAGCCTTCACTAACTTCCCTT 
2951 3000 



15 



ru30 



GTCTCCACCTTCTGGAGAAATCTGAAGGCTCACACTGATACCCTCCGCTT 
3001 3050 



CTCCCAGAGTCGCAGTTTCTTAGGCCTCAGTTAAATACCAGAATTGGATC 

S^aO 3051 3100 

W TCAGGCTCTGCTATCCCCACCCTACCTAACCAACCCCCTCCTCTCCCATC 

■'Z 3101 3150 

fiUS CTTACTAGCCAAAGCCCTTTCAACCCTTGGGGCTTTTCCTACACCTACAC 

%j 3151 3200 

|:| ACCAGGGCAATTTTAGAACTCATGGCTCTCCTAGAAAACGCCTACCTCCT 

hi 3201 3250 



TGGAGACTGACCCTCTACAGTCCAGGAGGCAGACACTCAGACAGAGGAAC 
3251 3300 



TCTGTCCTTCAGTCGCGGGAGTTCCAGAAAGAGCCATACTCCCCTGCAGA 
35 3301 3350 

GCTAACTAAGCTGCCAGGACCCAGCCAGAGCATCCCCCTTTAGCCGAGGG 
3351 3400 

40 CCAGCTCCCCAGAATGAAAAACCTGTCTGGGGCCCCTCCCTGAGGCTACA 

3401 3450 

GTCGCCAAGGGGCAAGTTGGACTGGATTCCCAGCAGCCCCTCCCACTCCG 
3451 3500 

45 

AGACAAAATCAGCTACCCTGGGGCAGGCCTCATTGGCCCCAGGAAACCCC 
3501 3550 
AGCCTGTCAGCACCTGTTCCAGGATCCAGTCCCAGCGCAGTA 
3551 

50 3592 
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AFP-TRE. SEQ ID N0:_. 

GCATTGCTGTGAACTCTGTACTTAGGACTAAACTTTGAGCAATAACACACATAGATTGAG 



GATTGTTTGCTGTTAGCATACAAACTCTGGTTCAAAGCTCCTCTTTATTGCTTGTCTTGG 

1 2 1 AAAATTTGCTGTTCTTCATGGTTTCTCTTTTCACTGCTATCTATTTTTCTCAACCACTCA 

181 CATGGCTACAATAACTGTCTGCAAGCTTATGATTCCCAAATATCTATCTCTAGCCTCAAT 

241 CTTGTTCCAGAAGATAAAAAGTAGTATTCAAATGCACATCAACGTCTCCACTTGGAGGGC 

3 01 TTAAAGACGTTTCAACATACAAACCGGGGAGTTTTGCCTGGAATGTTTCCTAAAATGTGT 

3 61 CCTGTAGCACATAGGGTCCTCTTGTTCCTTAAAATCTAATTACTTTTAGCCCAGTGCTCA 
421 TCCCACCTATGGGGAGATGAGAGTGAAAAGGGAGCCTGATTAATAATTACACTAAGTCAA 

4 8 1 TAGGCATAGAGCCAGGACTGTTTGGGTAAACTGGTCACTTTATCTTAAACTAAATATATC 
541 CAAAACTGAACATGTACTTAGTTACTAAGTCTTTGACTTTATCTCATTCATACCACTCAG 
601 CTTTATCCAGGCCACTTATGAGCTCTGTGTCCTTGAACATAAAATACAAATAACCGCTAT 
661 GCTGTTAATTATTGGCAAATGTCCCATTTTCAACCTAAGGAAATACCATAAAGTAACAGA 
721 TATACCAACAAAAGGTTACTAGTTAACAGGCATTGCCTGAAAAGAGTATAAAAGAATTTC 
781 AGCATGATTTTCCATATTGTGCTTCCACCACTGCCAATAACA (822) 



Probasin -TRE SEQ ID N0:_ 

-426 

5 ' -AAGCTTCGACAAGTGCATTTAGCCTCTCCAGTATTGCTGATGAATCCACAGT 

TCAGGTTCAATGGCGTTCAAAACTTGATCAAAAATGACCAGACTTTATATTTA 

CACCAACATCTATCTGATTGGAGGAATGGATAATAGTCATCATGTTTAAACAT 

PTarraTTPrAGTTAAGAAAATATGATAGC ATCTTGTTCTTAGT CTTTTTCTTA 

ARE-1 

ATAGGGACATAAAGCCCACAAATAAAAATATGCCTGAAGAATGGGACAGGC 

ATTGGGCATTGTCCATGCCT AGTAAAGTACTCCAAGAACCTATTT GTATACTA 

ARE- 2 

GATGACACAATGTCAATGTCTGTGTACAACTGCCAACTGGGATGCAAGACAC 

TGCCCATG CCAAT CATCCTGAAAAGCAGC TATAAAAA GCAGGAAGCTACTCT 
CAAT box TATAA box 

+ 1 +28 
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GCACCTTGTCAGTAGGTCCAGATACCTACAG - 3 ' 
Transcription site 



Tyrosinase-TRE SEQ ID N0:__ 

PinAlend ^i^^^* 



1 CCGGigiiSAAAATGATAAGTTGAATTCTGTCTTCGAGAACATAGAAAAGAA 
5 1 TTATGAAATGCCAAC ATGTGGTTACAAGTAATGCAGACCCAAGGCTCCCC 

10 101 AGGGAC AAGAAGTCTTGTGTTAACTCTTTGTGGCTCTGAAAGAAAGAGAG 

151 AGAGAAAAGATTAAGCCTCCTTGTGGAGATCATGTGATGACTTCCTGATT 



2 01 CCAGCCAGAGCGAGCATTTCCATGGAAACTTCTCTTCCTCTTOQCTCGAG 



— g-5± (SfTTACTAACCTTATTGTTAATATTCTAACCATAAGAATTAAACTATTAAT 

3 01 GGTGAATAGAGTTTTTCACTTTAACATAGGCCTATCCCACTGGTGGGATA 
15 3 51 CGAGCCAATTCGAAAGAAAAAGTCAGTCATGTGCTTTTCAGAGGATGAAA 

401 GCTTAAGATAAAGACTAAAAGTGTTTGATGCTGGAGGTGGGAGTGGTATT 

4 51 ATATAGGTCTCAGCCAAGACATGTGATAATCACTGTAGTAGTAGCTGGAA 

5 01 AGAGAAATCTGTGACTCCAATTAGCCAGTTCCTGCAGACCTTGTC^ + 

PinAI end 

20 Human glandular kallikrein-TRE SEQ ID N0:_ 



gaattcagaa ataggggaag gttgaggaag gacactgaac tcaaagggga tacagtgatt 60 

ggtttatttg tcttctcttc acaacattgg tgctggagga attcccaccc tgaggttatg 
25 120 

aagatgtctg aacacccaac acatagcact ggagatatga gctcgacaag agtttctcag 
180 

30 ccacagagat tcacagccta gggcaggagg acactgtacg ccaggcagaa tgacatggga 

240 

attgcgctca cgattggctt gaagaagcaa ggactgtggg aggtgggctt tgtagtaaca 
300 

35 

ag^gggcagg gtgaactctg attcccatgg gggaatgtga tggtcctgtt acaaattttt 
360 

caagctggca gggaataaaa cccattacgg tgaggacctg tggagggcgg ctgccccaac 
40 42 0 

tgataaagga aatagccagg tgggggcctt tcccattgta ggggggacat atctggcaat 
480 

45 agaagccttt gagacccttt agggtacaag tactgaggca gcaaataaaa tgaaatctta 

540 

tttttcaact ttatactgca tgggtgtgaa gatatatttg tttctgtaca gggggtgagg 

600 

50 
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gaaaggaggg gaggaggaaa gttcctgcag gtctggtttg gtcttgtgat ccagggggtc 
660 

ttggaactat ttaaattaaa ttaaattaaa acaagcgact gttttaaatt aaattaaatt 
5 720 

aaattaaatt ttactttatt ttatcttaag ttctgggcta catgtgcagg acgtgcagct 
780 

10 ttgttacata ggtaaacgtg tgccatggtg gtttgctgta cctatcaacc catcacctag 

840 

gtattaagcc cagcatgcat tagctgtttt tcctgacgct ctccctctcc ctgactccca 
900 

15 

caacaggccc cagtgtgtgt tgttcccctc cctgtgtcca tgtgttctca ttgttcagct 
960 

cccacttata agtgagaaca tgtggtgttt ggttttctgt ttctgtgtta gtttgctgag 
20 1020 

gataatggct tccacctcca tccatgttcc tgcaaaggac gtgatcttat tcttttttat 
1080 

tE5 ggttgcatag aaattgtttt tacaaatcca attgatattg tatttaatta caagttaatc 

0 1140 

r: taattagcat actagaagag attacagaag atattaggta cattgaatga ggaaatatat 

n 12 0 0 

30 

'I aaaataggac gaaggtgaaa tattaggtag gaaaagtata atagttgaaa gaagtaaaaa 

1260 

:| aaaatatgca tgagtagcag aatgtaaaag aggtgaagaa cgtaatagtg actttttaga 

il35 1320 

1 ccagattgaa ggacagagac agaaaaattt taaggaattg ctaaaccatg tgagtgttag 
-| 1380 

''40 aagtacagtc aataacatta aagcctcagg aggagaaaag aataggaaag gaggaaatat 

1440 

gtgaataaat agtagagaca tgtttgatgg attttaaaat atttgaaaga cctcacatca 
1500 

45 

aaggattcat accgtgccat tgaagaggaa gatggaaaag ccaagaagcc agatgaaagt 
1560 

tagaaatatt attggcaaag cttaaatgtt aaaagtccta gagagaaagg atggcagaaa 
50 1620 

tattggcggg aaagaatgca gaacctagaa tataaattca tcccaacagt ttggtagtgt 
1680 

55 gcagctgtag ccttttctag ataatacact attgtcatac atcgcttaag cgagtgtaaa 

1740 

atggtctcct cactttattt atttatatat ttatttagtt ttgagatgga gcctcgctct 
1800 

60 
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gtctcctagg ctggagtgca atagtgcgat accactcact gcaacctctg cctcctctgt 
1860 

tcaagtgatt ttcttacctc agcctcccga gtagctggga ttacaggtgc gtgccaccac 
5 1920 

acccggctaa tttttgtatt ttttgtagag acggggtttt gccatgttgg ccaggctggt 
1980 

10 cttgaactcc tgacatcagg tgatccacct gccttggcct cctaaagtgc tgggattaca 

2040 

ggcatgagcc accgtgccca accactttat ttatttttta tttttatttt taaatttcag 
2100 

15 

cttctatttg aaatacaggg ggcacatata taggattgtt acatgggtat attgaactca 
2160 

ggtagtgatc atactaccca acaggtaggt tttcaaccca ctccccctct tttcctcccc 
20 2220 

attctagtag tgtgcagtgt ctattgttct catgtttatg tctatgtgtg ctccaggttt 
2280 

l§5 agctcccacc tgtaagtgag aacgtgtggt atttgatttt ctgtccctgt gttaattcac 

HI 2340 

'^11 ttaggattat ggcttccagc tccattcata ttgctgtaaa ggatatgatt catttttcat 

2400 

Bo 

Ul ggccatgcag tattccatat tgcgtataga tcacattttc tttctttttt ttttttgaga 

2460 

ri cggagtcttg ctttgctgcc taggctggag tgcagtagca cgatctcggc tcactgcaag 

hB5 2520 

cttcacctcc ggggttcacg tcattcttct gtctcagctt cccaagtagc tgggactaca 
2580 

H40 ggcgcccgcc accacgtccg gctaattttt ttgtgtgttt ttagtagaga tgggggtttc 

2640 

actgtgttag ccaggatggt cttgatctcc tgaccttgtg gtccacctgc ctcggtctcc 
2700 

45 

caaagtgctg ggattacagg ggtgagccac tgcgcccggc ccatatatac cacattttct 
2760 

ttaaccaatc caccattgat gggcaactag gtagattcca tggattccac agttttgcta 
50 2820 

ttgtgtgcag tgtggcagta gacatatgaa tgaatgtgtc tttttggtat aatgatttgc 
2880 

55 attcctttgg gtatacagtc attaatagga gtgctgggtt gaacggtggc tctgtttaaa 

2940 

attctttgag aattttccaa actgtttgcc atagagagca aactaattta catttccacg 
3000 

60 
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aacagtatat aagcattccc ttttctccac 
3060 

tattttaaaa aagaatatgt tgttgttttc 
3120 

gttacatagg tagtaaacgt gagccatggt 
3180 

ggtatgaagc cctgcctgca ttagctcttt 
3240 

caccctgaca gggcaaacag acaacctaca 
3300 

tctgacaaag gtcaagaata tccagaatct 
3360 

aataatagcc actctgactg gcgtgaaatg 
3420 

tctgatgatc agtgacgatg agcatttttt 
3480 

agaagtgtct cttcatgcct tttggccact 
3540 

aagttcctta tagattctgg atattagact 

3600 

tcttccattc tgtaggttgt ctgtttactc 
3660 

atcttagttt aattagaaac cacctgccaa 
3720 

acttagtcat aaactctttg ccaaggtctg 
3780 

ctagaatttt gaaagtctga atgtaaacat 
3840 

ttgtatatgt gaaaggtcta ctctcatttt 
3900 

tttctttctt tctttctttc tttctttctt 
3960 

tttctttctt tctctttctt tctctctttc 
4020 

tgcccaggct gcagtgcagc ggcacgatct 
4080 

caactgattc tcctgcatca gccttccaag 
4140 

cccgactaat ttttgtattt ttagtagaga 
4200 



agctttgtca tcatggtttt tttttttctt 
ccagggtaca tgtgcaggat gtgcaggttt 
ggtttgctgc acctgtcaac ccattacctg 
tccctaatgc tctcactact gccccaccct 
gaatgggagg aaatttttgc aatctattca 
acaaggaact taagcaaatt tttacttttt 
gtatctcatt gtggttttca tttgaatttc 
catatttgtt ggctgcttgt acgtcttttg 
ttaatgggat tattttttgc tttttagttt 
tcttattgga tgcatagttt gtgaatactc 
tattgatggc ttcttttgct gtgccgaagc 
tttttgtttt tgttgcaatt gcttttgggg 
ggtcaagaag agtatttcct aggttttctt 
ttgcattttt aatgcatctt gagttagttt 
ctttccctct ttctttcttt ctttcttttc 
tctttctttc tttctttttg tccttctttc 
tttttttttt ttgatggagt attgctctgt 
cggctcactg caacctctgc ctcctgggtt 
tagctgggat tataggcgcc cgccaccacg 
cggggttgtg ccatgttggc caggctggtt 
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tgaaactcct gacctcaaac 
4260 

gtgtgagcca ctgtgcccag 
4320 

tattttggga ccttgagaag 
4380 

aaatctttgg catggcttgg 
4440 

tttataaaag ctccagctaa 
4500 

gctatacttt acacaaataa 
4560 

aaattggtcc tgctatgatt 
4620 

ttgcatctga cctttttttc 
4680 

attattttct ggttgcaaaa 
4740 

acatttatct ggctctttac 
4800 

cttacagttc tactcttcaa 
4860 

aaaactgaag ccatggtatt 
4920 

tcatatgcaa tccactgggc 
4980 

atacatttaa aaccctcact 
5040 

attgtaagag ccagattcgg 
5100 

agaaggtaaa gcaacctatc 
5160 

gttctaggaa gactgacagt 
5220 

atccccagtg ttcggaggtg 
5280 

cttctttgtg gtgttacagt 
5340 

tgtgtggccc ctcccctccc 
5400 



gatctgcctg ccttggcctc 
ccaagaatgt cattttctaa 
agsggaattc atacaggtat 
cttcaagact ttaggctctt 
gctaccttaa aaggggcctg 
acaggccaaa tataatgagg 
tactcttggt aagaacaggg 
tgaattttta tatgtgccta 
actctctaaa gaagaacttg 
tagaacagct ttcttgtttt 
attattgtta tgtgtatctc 
ctgaggacta gagatgactc 
tcgatctgct tcaaattgct 
aaaggatcag ggaccatcat 
ggggtagagt gtggaggtca 
ctgaaagcta acctgccatg 
ttgggtctgt gtcattgccc 
ggacttggtg gtaggtgatt 
gatagtgagt gagttctcgt 
tctcttggtc ctcctactgc 



ccaaagtgct gggattacag 
gaggtccaag aacctcaaga 
tacaagcaca gcctaatggc 
aaaagtcgaa tccaaaaatt 
tatggctgat cactcttctt 
ccaaaattta ttttgcaaat 
aaaatagaga aaaatttaga 
caatttgagc taaatcctga 
gttttcattg tcttcgtgac 
tggtgttcta gcttgtgtgc 
atagttttcc ttcttttgag 
aacagagctg gtgaatctcc 
gatgcactgc tgctaaagct 
ggaagaggag gaaacatgaa 
gagcaactcc accttgaata 
gtggcttctg attaacctct 
aaatctcatg ttaaattgta 
cggtcatggg agtagatttt 
gagatctggt catttaaaag 
catgtaagat acctgctcct 
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gctttgcctt ctaccataag taaaagcccc ctgaggcctc cccagaagca gatgccacca 
5460 

tgcttcctgt acagcctgca gaaccatcag ccaattaaac ctcttttctg tataaattac 
5520 

cagtcttgag tatctcttta cagcagtgtg agaacggact aatacaaggg tctccaaaat 
5580 

tccaagttta tgtattcttt cttgccaaat agcaggtatt taccataaat cctgtcctta 
5640 

ggtcaaacaa ccttgatggc atcgtacttc aattgtctta cacattcctt ctgaatgact 
5700 

cctcccctat ggcatataag ccctgggtct tgggggataa tggcagaggg gtccaccatc 
5760 

ttgtctggct gccacctgag acacggacat ggcttctgtt ggtaagtctc tattaaatgt 
5820 

ttctttctaa gaaactggat ttgtcagctt gtttctttgg cctctcagct tcctcagact 
5880 

ttggggtagg ttgcacaacc ctgcccacca cgaaacaaat gtttaatatg ataaatatgg 
5940 

atagatataa tccacataaa taaaagctct tggagggccc tcaataattg ttaagagtgt 
6000 

aaatgtgtcc aaagatggaa aatgtttgag aactactgtc ccagagattt tcctgagttc 

6060 

tagagtgtgg gaatatagaa cctggagctt ggcttcttca gcctagaatc aggagtatgg 
6120 

ggctgaagtc tgaagcttgg cttcagcagt ttggggttgg cttccggagc acatatttga 
6180 

catgttgcga ctgtgatttg gggtttggta tttgctctga atcctaatgt ctgtccttga 
6240 

ggcatctaga atctgaaatc tgtggtcaga attctattat cttgagtagg acatctccag 
6300 

tcctggttct gccttctagg gctggagtct gtagtcagtg acccggtctg gcatttcaac 
6360 

ttcatataca gtgggctatc ttttggtcca tgtttcaacc aaacaaccga ataaaccatt 
6420 

agaacctttc cccacttccc tagctgcaat gttaaaccta ggatttctgt ttaataggtt 
6480 

catatgaata atttcagcct gatccaactt tacattcctt ctaccgttat tctacaccca 
6540 

ccttaaaaat gcattcccaa tatattccct ggattctacc tatatatggt aatcctggct 
6600 
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ttgccagttt ctagtgcatt 
6660 

aagagtccct ctgcagagtt 
6720 

agatttcaag ggagtcgcca 
6780 

taactcaatg aaagctgtta 
6840 

gaaaatctag caagggaaga 
6900 

attcctgttg ttttctccca 
6960 

tgtgataata ttcagtgtat 
7020 

gcttggttgc acagacatgt 
7080 

agacgtctac agctgatagg 
7140 

tatccagtgt ggcccaagct 
7200 

aaaagcttag agatgacctc 
7260 

agaatccttt accgcacact 
7320 

tgtgtgagac accagcacat 

7380 

ctgagtgcta agaaagtatt 

7440 

cacaacatgg ctacagggtg 
7500 

taagaagggg gtgaccaata 
7560 

tgattagaca ttgtctgcag 
7620 

gcactttcct tgacccccta 
7680 

tgataccagg agcccatcat 
7740 

cactcttccc agctccatct 
7800 



aacatacctg atttacattc 
caggagttct caagatggcc 
agatcatcct caggttcagt 
tgctcatggc tatggtttat 
gttgcatggg gcaaagacaa 
gtggtgtcat ggaaagcagt 
tgccaatcag ggaactcaac 
cgaccacctt catggctgaa 
ctgtaaccca acattgtcac 
cccgtgtaaa cacaggcact 
ccaggagctg aatgcaaaga 
ctccttcaca gggttattgt 
gtctggctgt ggagagtgac 
aggcatggct ttcagcactc 
ggcactacta gcctcatttg 
ggtcagagtc attctggatg 
agaaattatg gctggatgtc 
tctcagatct tgactttgag 
aatctctctg tgtcctctcc 
ccagctggcc aggtgtagcc 



ttttacttta aagtggaaat 
cttacttctg acatcaattg 
gattgctggt agccctcata 
tacagcaaaa gaatagagat 
ggagagctcc aagtgcagag 
atcttctcca tacaatgatg 
tgagccttga ttatattgga 
ctttagtact tagcccctcc 
cataaatcac attgttagac 
ctaaacaggc aggatatttc 
cctggcctct ttgggcaagg 
gaggatcaaa cgtggtcatg 
ttctatgtgt gctaacattg 
acagatgctc atctaatcct 
acagaggaaa ggactgtgga 
caaggggctc cagaggacca 
tctgccccgg aaagggggat 
gttatctcag acttcctcta 
ccttcctcag tcttactgcc 
acagtaccta actctttgca 
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gagaactata aatgtgtatc ctacagggga gaaaaaaaaa aagaactctg aaagagctga 
7860 

cattttaccg acttgcaaac acataagcta acctgccagt tttgtgctgg tagaactcat 
7920 

gagactcctg ggtcagaggc aaaagatttt attacccaca gctaaggagg cagcatgaac 
7980 

tttgtgttca catttgttca ctttgccccc caattcatat gggatgatca gagcagttca 
8040 

ggtggatgga cacaggggtt tgtggcaaag gtgagcaacc taggcttaga aatcctcaat 
8100 

cttataagaa ggtactagca aacttgtcca gtctttgtat ctgacggaga tattatcttt 
8160 

ataattgggt tgaaagcaga cctactctgg aggaacatat tgtatttatt gtcctgaaca 
8220 

gtaaacaaat ctgctgtaaa atagacgtta actttattat ctaaggcagt aagcaaacct 
8280 

agatctgaag gcgataccat cttgcaaggc tatctgctgt acaaatatgc ttgaaaagat 
8340 

ggtccagaaa agaaaacggt attattgcct ttgctcagaa gacacacaga aacataagag 
8400 

aaccatggaa aattgtctcc caacactgtt cacccagagc cttccactct tgtctgcagg 
8460 

acagtcttaa catcccatca ttagtgtgtc taccacatct ggcttcaccg tgcctaacca 
8520 

agatttctag gtccagttcc ccaccatgtt tggcagtgcc ccactgccaa ccccagaata 
8580 

agggagtgct cagaattccg aggggacatg ggtggggatc agaacttctg ggcttgagtg 
8640 

cagagggggc ccatactcct tggttccgaa ggaggaagag gctggaggtg aatgtccttg 
8700 

gaggggagga atgtgggttc tgaactctta aatccccaag ggaggagact ggtaaggtcc 
8760 

cagcttccga ggtactgacg tgggaatggc ctgagaggtc taagaatccc gtatcctcgg 
8820 

gaaggagggg ctgaaattgt gaggggttga gttgcagggg tttgttagct tgagactcct 
8880 

tggtgggtcc ctgggaagca aggactggaa ccattggctc cagggtttgg tgtgaaggta 
8940 

atgggatctc ctgattctca aagggtcaga ggactgagag ttgcccatgc tttgatcttt 
9000 
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ccatctactc cttactccac ttgagggtaa tcacctactc ttctagttcc acaagagtgc 
9060 

gcctgcgcga gtataatctg cacatgtgcc atgtcccgag gcctggggca tcatccactc 
5 9120 

atcattcagc atctgcgcta tgcgggcgag gccggcgcca tgacgtcatg tagctgcgac 
9180 

10 tatccctgca gcgcgcctct cccgtcacgt cccaaccatg gagctgtgga cgtgcgtccc 

9240 



15 



30 
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ctggtggatg tggcctgcgt ggtgccaggc cggggcctgg tgtccgataa agatcctaga 
9300 

accacaggaa accaggactg aaaggtgcta gagaatggcc atatgtcgct gtccatgaaa 
9360 



tctcaaggac ttctgggtgg agggcacagg agcctgaact tacgggtttg ccccagtcca 
20 9420 

ctgtcctccc aagtgagtct cccagatacg aggcactgtg ccagcatcag cttcatctgt 
9480 

25 accacatctt gtaacaggga ctacccagga ccctgatgaa caccatggtg tgtgcaggaa 

9540 

gagggggtga aggcatggac tcctgtgtgg tcagagccca gagggggcca tgacgggtgg 
9600 



9g^9g^99ct gtggactggc tcgagaagtg ggatgtggtt gtgtttgatt tcctttggcc 
9660 



agataaagtg ctggatatag cattgaaaac ggagtatgaa gaccagttag aatggagggt 
35 9720 

caggttggag ttgagttaca gatggggtaa aattctgctt cggatgagtt tggggattgg 
9780 

40 caatctaaag gtggtttggg atggcatggc tttgggatgg aaataggttt gtttttatgt 

9840 



tggctgggaa gggtgtgggg attgaattgg ggatgaagta ggtttagttt tggagataga 
9900 

atacatggag ctggctattg catgcgagga tgtgcattag tttggtttga tctttaaata 
9960 



aaggaggcta ttagggttgt cttgaattag attaagttgt gttgggttga tgggttgggc 
50 10020 

ttgtgggtga tgtggttgga ttgggctgtg ttaaattggt ttgggtcagg ttttggttga 
10080 

55 ggttatcatg gggatgagga tatgcttggg acatggattc aggtggttct cattcaagct 

10140 



gaggcaaatt tcctttcaga cggtcattcc agggaacgag tggttgtgtg ggggaaatca 
10200 



60 
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ggccactggc tgtgaatatc cctctatcct ggtcttgaat tgtgattatc tatgtccatt 
10260 



ctgtctcctt cactgtactt ggaattgatc tggtcattca gctggaaatg ggggaagatt 
5 10320 

ttgtcaaatt cttgagacac agctgggtct ggatcagcgt aagccttcct tctggtttta 
10380 

10 ttgaacagat gaaatcacat tttttttttc aaaatcacag aaatcttata gagttaacag 

10440 



15 



So 



45 



tggactctta taataagagt taacaccagg actcttattc ttgattcttt tctgagacac 
10500 

caaaatgaga tttctcaatg ccaccctaat tctttttttt tttttttttt tttttgagac 
10560 



acagtctggg tcttttgctc tgtcactcag gctggagcgc agtggtgtga tcatagctca 
20 10620 

ctgaaccctt gacctcctgg acttaaggga tcctcctgct tcagcctcct gagtagatgg 
10680 

vfl25 ggctacaggt gcttgccacc acacctggct aattaaattt tttttttttt tttgtagaga 

Co 1074 0 



aagggtctca ctttgttgcc ctggctgatc ttgaacttct gacttcaagt gattcttcag 
10800 

ccttggactc ccaaagcact gggattgctg gcatgagcca ctcaccgtgc ctggcttgca 
10860 



gcttaatctt ggagtgtata aacctggctc ctgatagcta gacatttcag tgagaaggag 
U35 10920 

=;r gcattggatt ttgcatgagg acaattctga cctaggaggg caggtcaaca ggaatccccg 

3 10980 

40 ctgtacctgt acgttgtaca ggcatggaga atgaggagtg aggaggccgt accggaaccc 

11040 



catattgttt agtggacatt ggattttgaa ataataggga acttggtctg ggagagtcat 
11100 

atttctggat tggacaatat gtggtatcac aaggttttat gatgagggag aaatgtatgt 
11160 



ggggaaccat tttctgagtg tggaagtgca agaatcagag agtagctgaa tgccaacgct 
50 11220 

tctatttcag gaacatggta agttggaggt ccagctctcg ggctcagacg ggtataggga 
11280 

55 ccaggaagtc tcacaatccg atcattctga tatttcaggg catattaggt ttggggtgca 

11340 



aaggaagtac ttgggactta ggcacatgag actttgtatt gaaaatcaat gattggggct 
11400 



60 
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ggccgtggtg ctcacgcctg taatctcatc actttgggag accgaagtgg gaggatggct 
11460 

tgatctcaag agttggacac cagcctaggc aacatggcca gaccctctct ctacaaaaaa 
5 11520 

attaaaaatt agctggatgt ggtggtgcat gcttgtggtc tcagctatcc tggaggctga 
11580 

10 gacaggagaa tcggttgagt ctgggagttc aaggctacag ggagctgcga tcacgccgct 

11640 

gcactccagc ctgggaaaca gagtgagact gtctcagaat ttttttaaaa aagaatcagt 
11700 

IS 

gatcatccca acccctgttg ctgttcatcc tgagcctgcc ttctctggct ttgttcccta 
11760 

gatcacatct ccatgatcca taggccctgc ccaatctgac ctcacaccgt gggaatgcct 
20 11820 

ccagactgat ctagtatgtg tggaacagca agtgctggct ctccctcccc ttccacagct 
... 11880 

UI25 ctgggtgtgg gagggggttg tccagcctcc agcagcatgg ggagggcctt ggtcagcatc 

Hi 11940 

4^ taggtgccaa cagggcaagg gcggggtcct ggagaatgaa ggctttatag ggctcctcag 

hi 12000 

|f|30 

\l ggaggccccc cagccccaaa ctgcaccacc tggccgtgga caccggt 

12047 



SI 35 

h HRE-TRE SEQ ID N0:_ 

ccccgagg cagtgcat gaggctcagg gcgtgcgt gagtcgcagcgagaccccg gggtgcag 

gccgga 

40 

PSA-TRE SEQ ID N0:_ 

aagcttctag ttttcttttc ccggtgacat cgtggaaagc actagcatct ctaagcaatg 60 

45 atctgtgaca atattcacag tgtaatgcca tccagggaac tcaactgagc cttgatgtcc 

120 

agagattttt gtgttttttt ctgagactga gtctcgctct gtgccaggct ggagtgcagt 
180 

50 

ggtgcaacct tggctcactg caagctccgc ctcctgggtt cacgccattc tcctgcctca 
240 

gcctcctgag tagctgggac tacaggcacc cgccaccacg cctggctaat ttttttgtat 
55 300 
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ttttagtaga gatggggttt 

360 

gatctgccca ccttggcctc 
420 

ccgatatcca gagatttttt 
480 

ttgactttta gtatccagcc 
540 

cagcacaaat cacaccgtta 
600 

actctaatct ggcaggatac 
660 

tggaaaagaa aaagaaagaa 
720 

gaggggaaac gcctgaggtc 
780 

agggtcattg tgacgatcaa 
840 

tggggagtgc cgtgtaagtg 
900 

atcttcagca cttacagatg 
960 

ctggcctcat ttgatggaga 
1020 

gacactctgg atgctgggga 
1080 

attgtggcct gatgtccctg 
1140 

cttgaccctc tcttttaggg 
1200 

ttctgtaccc tcttgactct 
1260 

tatctctatt cccagctggc 
1320 

aaggggctga cattttactg 
1380 

tgctggcaga gtccatgaga 
1440 

agcagacagc atgaggttca 
1500 



cactgtgtta gccaggatgg 
ccaaagtgct gggatgacag 
ggggggctcG atcacacaga 
cctctagaaa tctagctgat 
gactatctgg tgtggcccaa 
tccaaagcat tagagatgac 
aggaaaaaaa aaaaaaaaaa 
tttgagcaag gtcagtcctc 
atgtggtcac gtgtatgagg 
tatgcttgca ctgctgaatg 
ctcatctcat cctcacagca 
aagtggctgt ggctcagaaa 
ctccagagac catgaccact 
tcctggagag ggtggaggtg 
ctctttctga cctccaccat 
atgaccccca ccgcccactg 
cagtgcagtc tcagtgccca 
acttgcaaac aaataagcta 
ctcctgagtc agaggcaaag 
tgttcacatt agtacacctt 



tctcagtctc ctgacctcgt 
gcgtgagcca ccgcgcctgg 
catgttgact gtcttcatgg 
atagtgtggc tcaaaacctt 
accttcaggt gaacaaaggg 
ctcttgcaaa gaaaaagaaa 
gagatgacct ctcaggctct 
tgttgcacag tctccctcac 
caccagcaca tgcctggctc 
gctgggatgt gtcagggatt 
tcactatggg atgggtatta 
ggggggacca ctagaccagg 
caccaactgc agagaaatta 
gaccttcact aacctcctac 
ggtactagga ccccattgta 
catccagctg ggtcccctcc 
cctgtttgtc agtaactctg 
actttccaga gttttgtgaa 
gcttttactg ctcacagctt 
gcccccccca aatcttgtag 
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ggtgaccaga gcagtctagg tggatgctgt gcagaagggg tttgtgccac tggtgagaaa 
1560 

cctgagatta ggaatcctca atcttatact gggacaactt gcaaacctgc tcagcctttg 
5 1620 

tctctgatga agatattatc ttcatgatct tggattgaaa acagacctac tctggaggaa 
1680 

10 catattgtat cgattgtcct tgacagtaaa caaatctgtt gtaagagaca ttatctttat 

1740 

tatctaggac agtaagcaag cctggatctg agagagatat catcttgcaa ggatgcctgc 
1800 

15 

tttacaaaca tccttgaaac aacaatccag aaaaaaaaag gtgttactgt ctttgctcag 
1860 

aagacacaca gatacgtgac agaaccatgg agaattgcct cccaacgctg ttcagccaga 
20 1920 

gccttccacc ctttctgcag gacagtctca acgttccacc attaaatact tcttctatca 
1980 

catcccgctt ctttatgcct aaccaaggtt ctaggtcccg atcgactgtg tctggcagca 
CO 204 0 

ctccactgcc aaacccagaa taaggcagcg ctcaggatcc cgaaggggca tggctgggga 

r% 2100 

yi tcagaacttc tgggtttgag tgaggagtgg gtccaccctc ttgaatttca aaggaggaag 

^''l 2160 

O aggctggatg tgaaggtact gggggaggga aagtgtcagt tccgaactct taggtcaatg 

m35 2220 

agggaggaga ctggtaaggt cccagctccc gaggtactga tgtgggaatg gcctaagaat 
J.J 2280 

^"'40 ctcatatcct caggaagaag gtgctggaat cctgaggggt agagttctgg gtatatttgt 

2340 

ggcttaaggc tctttggccc ctgaaggcag aggctggaac cattaggtcc agggtttggg 
2400 

45 

gtgatagtaa tgggatctct tgattcctca agagtctgag gatcgagggt tgcccattct 
2460 

tccatcttgc cacctaatcc ttactccact tgagggtatc accagccctt ctagctccat 
50 2520 

gaaggtcccc tgggcaagca caatctgagc atgaaagatg ccccagaggc cttgggtgtc 
2580 

55 atccactcat catccagcat cacactctga gggtgtggcc agcaccatga cgtcatgttg 

2640 

ctgtgactat ccctgcagcg tgcctctcca gccacctgcc aaccgtagag ctgcccatcc 
2700 

60 
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tcctctggtg ggagtggcct gcatggtgcc aggctgaggc ctagtgtcag acagggagcc 
2760 



tggaatcata gggatccagg actcaaaagt gctagagaat ggccatatgt caccatccat 
5 2820 

gaaatctcaa gggcttctgg gtggagggca cagggacctg aacttatggt ttcccaagtc 
2880 

10 tattgctctc ccaagtgagt ctcccagata cgaggcactg tgccagcatc agccttatct 

2940 



15 



mo 



45 



ccaccacatc ttgtaaaagg actacccagg gccctgatga acaccatggt gtgtacagga 
3000 

gtagggggtg gaggcacgga ctcctgtgag gtcacagcca agggagcatc atcatgggtg 
3060 



999^99^99^ aatggacagg cttgagaacg gggatgtggt tgtatttggt tttctttggt 
20 3120 

tagataaagt gctgggtata ggattgagag tggagtatga agaccagtta ggatggagga 
3180 

125 tcagattgga gttgggttag ataaagtgct gggtatagga ttgagagtgg agtatgaaga 

1 3240 



ccagttagga tggaggatca gattggagtt gggttagaga tggggtaaaa ttgtgctccg 
3300 

gatgagtttg ggattgacac tgtggaggtg gtttgggatg gcatggcttt gggatggaaa 
3360 



tagatttgtt ttgatgttgg ctcagacatc cttggggatt gaactgggga tgaagctggg 
W35 342 0 

tttgattttg gaggtagaag acgtggaagt agctgtcaga tttgacagtg gccatgagtt 
d 3480 

40 ttgtttgatg gggaatcaaa caatggggga agacataagg gttggcttgt taggttaagt 

3540 



tgcgttgggt tgatggggtc ggggctgtgt ataatgcagt tggattggtt tgtattaaat 
3600 

tgggttgggt caggttttgg ttgaggatga gttgaggata tgcttgggga caccggatcc 
3660 



atgaggttct cactggagtg gagacaaact tcctttccag gatgaatcca gggaagcctt 
50 3720 

aattcacgtg taggggaggt caggccactg gctaagtata tccttccact ccagctctaa 
3780 

55 gatggtctta aattgtgatt atctatatcc acttctgtct ccctcactgt gcttggagtt 

3840 



60 



tacctgatca ctcaactaga aacaggggaa gattttatca aattcttttt tttttttttt 
3900 
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tttttttgag acagagtctc actctgttgc ccaggctgga gtgcagtggc gcagtctcgg 
3960 



ctcactgcaa cctctgcctc ccaggttcaa gtgattctcc tgcctcagcc tcctgagttg 
5 4020 

ctgggattac aggcatgcag caccatgccc agctaatttt tgtattttta gtagagatgg 
4080 

10 ggtttcacca atgtttgcca ggctggcctc gaactcctga cctggtgatc cacctgcctc 

4140 



15 



45 



agcctcccaa agtgctggga ttacaggcgt cagccaccgc gcccaqccac ttttgtcaaa 
4200 

ttcttgagac acagctcggg ctggatcaag tgagctactc tggttttatt gaacagctga 
4260 



aataaccaac tttttggaaa ttgatgaaat cttacggagt taacagtgga ggtaccaggg 
20 4320 

ctcttaagag ttcccgattc tcttctgaga ctacaaattg tgattttgca tgccacctta 
43 8 0 

IM5 atcttttttt tttttttttt aaatcgaggt ttcagtctca ttctatttcc caggctggag 

:0 444 0 



ttcaatagcg tgatcacagc tcactgtagc cttgaactcc tggccttaag agattctcct 
4500 

gcttcggtct cccaatagct aagactacag tagtccacca ccatatccag ataattttta 
4560 



aattttttgg ggggccgggc acagtggctc acgcctgtaa tcccaacacc atgggaggct 
iy35 4620 

M gagatgggtg gatcacgagg tcaggagttt gagaccagcc tgaccaacat ggtgaaactc 

O ^680 

40 tgtctctact aaaaaaaaaa aaaatagaaa aattagccgg gcgtggtggc acacggcacc 

4740 



tgtaatccca gctactgagg aggctgaggc aggagaatca cttgaaccca gaaggcagag 
4800 

gttgcaatga gccgagattg cgccactgca ctccagcctg ggtgacagag tgagactctg 
4860 



tctcaaaaaa aaaaaatttt tttttttttt ttgtagagat ggatcttgct ttgtttctct 
50 4920 

ggttggcctt gaactcctgg cttcaagtga tcctcctacc ttggcctcgg aaagtgttgg 
4980 

55 gattacaggc gtgagccacc atgactgacc tgtcgttaat cttgaggtac ataaacctgg 

5040 



60 



ctcctaaagg ctaaaggcta aatatttgtt ggagaagggg cattggattt tgcatgagga 
5100 
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tgattctgac ctgggagggc aggtcagcag gcatctctgt tgcacagata gagtgtacag 
5160 

gtctggagaa caaggagtgg ggggttattg gaattccaca ttgtttgctg cacgttggat 
5 5220 

tttgaaatgc tagggaactt tgggagactc atatttctgg gctagaggat ctgtggacca 
5280 

10 caagatcttt ttatgatgac agtagcaatg tatctgtgga gctggattct gggttgggag 

5340 

tgcaaggaaa agaatgtact aaatgccaag acatctattt caggagcatg aggaataaaa 
5400 

15 

gttctagttt ctggtctcag agtggtgcat ggatcaggga gtctcacaat ctcctgagtg 
5460 

ctggtgtctt agggcacact gggtcttgga gtgcaaagga tctaggcacg tgaggctttg 
20 5520 

tatgaagaat cggggatcgt acccaccccc tgtttctgtt tcatcctggg catgtctcct 
5580 

^^^25 ctgcctttgt cccctagatg aagtctccat gagctacaag ggcctggtgc atccagggtg 

Co 5640 

atctagtaat tgcagaacag caagtgctag ctctccctcc ccttccacag ctctgggtgt 
Ul 5700 

iPo 

gggagggggt tgtccagcct ccagcagcat ggggagggcc ttggtcagcc tctgggtgcc 
5760 

H agcagggcag gggcggagtc ctggggaatg aaggttttat agggctcctg ggggaggctc 

1^35 5820 

cccagcccca agctt 
O 5835 

40 
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CEATRE SBQIDNO: 

aagcttttta gtgctttaga cagtgagctg gtctgtctaa cccaagtgac ctcggctcca 60 

tactcagccc cagaagtgaa gggtgaagct gggtggagcc aaaccaggca agcctaccct 120 

cagggctccc agtggcctga gaaccattgg acccaggacc cattacttct agggtaagga . 180 

aggtacaaac accagatcca accatggtct ggggggacag ctgtcaaatg cctaeaaata 240 

tacctgggag aggagcaggc aaactatcac tgccccaggt tctctgaaca gaaacagagg 300 

ggcaacccaa agtccaaatc caggtgagca ggtgcaccaa atgcccagag atatgacgag 360 

gcaagaagtg aaggaaccac ccctgcatca aatgttttgc atgggaagga gaagggggtt 420 

tSfctcatgttc ccaatccagg agaatgcatt tgggatctgc cttcttctrca ctccttggtt 480 

X^agcaagacta agcaaccagg actctggatt tggggaaaga cgtttatttg tggaggccag 540 

P tgatgacaat cccacgaggg cctaggtgaa gagggcagga aggctcgaga cactggggac 600 - 

^ftgagtgaaaa ccacacccat gatctgcacc acccatggat gctccttcat tgctcacctt 660 

tctgttgata tcagatggcc ccattttctg taccttcaca gaaggacaca ggctagggtc 720 

I tgtgcatggc cttcatcccc ggggccatgt gaggacagca ggtgggaaag atcatgggtc 780 

I ctcctgggtc ctgcagggcc agaacattca tcacccatac tgacctccta gatgggaatg 840 

; gcttccctgg ggctgggcca acggggcctg ggcaggggag aaaggacgtc aggggacagg 900 

gaggaagggt catcgagacc cagcctggaa ggttcttgtc tctgaccatc caggetttac 960 

ttccctgcat ctacctttgg tcattttccc tcagcaatga ccagctctgc ttcctgatct 1020 

cagcctccca ccctggacac agcaccccag tccctggccc ggctgcatcc acccaatacc 1080 

ctgataaccc aggacccatt acttctaggg taaggagggt ccaggagaca gaagctgagg 1140 

aaaggtctga agaagtcaca tctgtcctgg ccagagggga aaaaccatca gatgctgaac 1200 

oe^qgaqaatg ttgacccagg aaagggaccg aggacccaag aaaggagtca gaccaccagg 1260 

gtttgcctga gaggaaggat caaggccccg agggaaagca gggctggctg catgtgcagg 1320 

acactggtgg ggcatatgtg tcttagattc tccctgaatt cagtgtccct gccatggcca 1380 

gactctctac tcaggcctgg acatgctgaa ataggacaat ggccttgtcc tctctcccca 1440 

ccatttggca agagacataa aggacattcc aggacatgcc ttcctgggag gtccaggttc 1500 

tctgtctcac acctcaggga ctgtagttac tgcatcagcc atggtaggtg ctgatctcac 1560 

ccagcctgtc caggcccttc cactctccac tttgtgacca tgtccaggac cacccctcag 1620 

atcctgagcc tgcaaatacc cccttgctgg gtgggtggat tcagtaaaca gtgagctcct 1680 
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etccagcecc-oagagccacc tctgtcacct t<;ctgctggg catcatccca ccttcacaag 1740 

cactaaagag catggggaga cctggctagc tgggtttctg catcacaaag aaaataatcc 1800 

cccaggttcg gattcccagg gctctgtatg tggagctgac agacctgagg ccaggagata 1860 

gcacaggtca gccctaggga gggtgggtca tccacccag'g ggacaggggt gcaccagcct 192D 

tgctactgaa agggcctccc caggacagcg ccatcagccc tgcctgagag ctttgctaaa 1980 

cagcagtcag aggaggccat ggcagtggct gagctcctgc tccaggcccc aacagaccag 2040 

accaacagca caatgcagtc cttccccaac gtcacaggtc accaaaggga aactgaggtg 2100 

ctacctaacc ttagagccat caggggagat aacagcccaa tttcccaaac aggccagttt 2160 

p^atcccatg acaatgacct ctctgctctc attcttccca aaataggacg ctgattctcc 2220 

gpcaccatgg atttctccct tgtcccggga gccttttctg ccccctatga tctgggcact 2280 

Ig^tgacacac acctcctctc tggtgacata tcagggtccc tcactgtcaa gcagtccaga 2340 

J^aggacagaa ccttggacag cgcccatctc agcttcaccc ttcctccttc acagggttca 2400 

"gggcaaagaa taaatggcag aggccagtga gcccagagat ggtgacaggc agtgacccag 2460 

rgggcagatgc ctggagcagg agctggcggg gccacaggga gaaggtcatg caggaaggga 2520 

n^acccagaaa tgggcaggaa aggaggacac aggctctgtg gggctgcagc ccagggttgg 2580 

Lectargagtg tgaagccatc tcagcaagta aggccaggtc ccatgaacaa gagtgggagc 2640 

acgtggcttc ctgctctgta tatggggtgg gggattccat gccccataga accagatggc 2700 

<^99gttcag atggagaagg agcaggacag gggatcccca ggataggagg accc<:agtgt 2760 

ccccacccag gcaggtgact gatgaatggg catgcagggt cctcctgggc tgggctctcc 2820 

ctttgtccct caggattcct tgaaggaaca tccggaagcc gaccacatct acctggtggg 2880 

ttctggggag tccatgtaaa gccaggagct tgtgttgcta ggaggggtca tggcatgtgc 2940 

tggcggcacc aaagagagaa acctgagggc aggcaggacc tggtctgagg aggcatggga 3000 

gcccagatgg ggagatggat gtcaggaaag gctgccccat cagggagggt gatagcaatg 3060 
gggggtctgt gggagtgggc acgtgggatt ccctgggctc tgccaagttc cctcccatag 3120 
* tcacaacctg gggacactgc ccatgaaggg gcgcctttgc ccagccagat gctgctggtt 3180 
ctgcccatcc actaccctct ctgctccagc cactctgggt ctttctccag atgccctgga 3240 
caqccctggc ctgggcctgt cccctgagag gtgttgggag aagctgagtc tctggggaca 3300 
ctctcatcag agtctgaaag gcacatcagg aaacatccct ggtctccagg actaggcaat 3360 
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gaggaaaggg ccccagctcc tccctttccc actgagaggg tcgaccctgg gtggccacag 
tgacttctgc gtctgtccca gtcacccrga aaccacaaca aaaccccagc cccagaccct 
gcaggtacaa tacatgtggg gacagtctgt acccagggga agccagttct ctcttcctag 
gagaccgggc ctcagggctg tgcccggogc aggcgggggc agcacgtgcc tgtccttgag 
aactcgggac cttaagggtc tctgctctgt gaggcacagc aaggatcctt ctgtccagag 
atgaaagcag ctcctgcccc tcctctgacc tcttcctcct tcccaaatct caaccaacaa 
ataggtgttt caaatctcat catcaaatct tcatccatcc acatgagaaa gcttaaaacc 
caatggattg acaacatcaa gagttggaac aagtggacat ggagatgtta cttgtggaaa 
tttagatgtg ttcagctatc gggcaggaga atctgtgtca aattccagca tggttcagaa 
%aatcaaaaa gtgtcacagt ccaaatgtgc aacagtgcag gggataaaac tgtggtgcat 
%caaactgag ggatattttg gaacatgaga aaggaaggga ttgctgctgc acagaacatg 
Fgatgatctca cacatagagt tgaaagaaag gagtcaatcg cagaatagaa aatgatcact 
flaattccacct ctataaagtt tccaagagga aaacccaatt ctgctgctag agatcagaat 
ggaggtgacc tgtgccttgc aatggctgrg agggtcacgg gagtgtcact tagtgcaggc 
"Jaatgtgccgt atcttaatct gggcaggcct ttcatgagca cataggaatg cagacattac 
■^ tgctgtgttc attttacttc - accggaaaag aagaataaaa tcagccgggc gcggtggctc 
3 acgcctgtaa tcccagcact ttagaagcct gaggtgggca gattacttga ggtcaggagt 
tcaagaccac cctggccaat atggtgaaac cccggctcta ctaaaaatac aaaaattagc 
tgggcatggt ggtgcgcgcc tgtaatccca gctactcggg aggctgaggc tggacaattg 
cttggaccca ggaagcagag gttgcagtga gccaagattg tgccactgca ctccagcttg 
ggcaacagag ccagactctg taaaaaaaaa aaaaaaaaaa aaaaaaagaa agaaagaaaa 
agaaaagaaa gtataaaatc tctttgggtt aacaaaaaaa gatccacaaa acaaacacca 
gctcttatca aacttacaca actctgccag agaacaggaa acacaaatac tcattaactc 
acttttgtgg caataaaacc ttcatgtcaa aaggagacca ggacacaatg aggaagtaaa 
actgcaggcc ctacttgggt gcagagaggg aaaatccaca aataaaacat taccagaagg 
agctaagatt tactgcattg agttcattcc ccaggtatgc aaggtgattt taacacctga 
aaatcaatca ttgcctttac tacatagaca gattagctag aaaaaaatta caactagcag 
aacagaagca atttggcctt cctaaaatrc cacatcatat catcatgatg gagacagtgc 

agacgccaat gacaataaaa agagggacct ccgtcacccg gtaaacatgt ccacacagct 
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3420 

3480 

3540 

3600 

366D 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 



ccagcaagca-eccgtcttcc cagtgaatca ctgtaacctc ccctttaatc agccccaggc 
aaggctgcct gcgatggcca cacaggctcc aacccgtggg cctcaacctc ccgcagaggc 
tctcctttgg ccaccccatg gggagagcat gaggacaggg cagagccctc tgatgcccac 
acatggcagg agctgacgcc agagccatgg gggctggaga gcagagctgc tggggtcaga 
gcttcctgag gacacccagg cctaagggaa ggcagctccc tggatggggg caaccaggct 
ccgggctcca acctcagagc ccgcatggga ggagccagca ctctaggcct ttcctagggt 
gactctgagg ggaccctgac acgacaggat cgctgaatgc acccgagatg aaggggccac 
cacgggaccc tgctctcgtg gcagatcagg agagagtggg acaccatgcc aggcccccat 
ggcatggctg cgactgaccc aggccactcc cctgcatgca tcagcctcgg taagtcacat 
gibcaagccc aggaccaatg tggaaggaag gaaacagcat cccctttagt gatggaaccc 
^gtcagtg caaagagagg ccatgagcag ttaggaaggg tggtccaacc tacagcacaa 
a&atcatct atcataagta gaagccctgc tccatgaccc ctgcatttaa ataaacgttt 
iljtaaatgag tcaaattccc tcaccatgag agctcacctg tgtgtaggcc catcacacac 
Ipaaacacac acacacacac acacacacac £cacacacac acagggaaag tgcaggatcc 
|j|gacagcac caggcaggct tcacaggcag agcaaacagc gtgaatgacc catgcagtgc 
fectgggcccc atcagctcag agaccctgtg agggctgaga tggggctagg caggggagag 
acttagagag ggtggggcct ccagggaggg ggctgcaggg agctgggtac tgccctccag 
ggagggggct gcagggagct gggtactgcc ctccagggag ggggctgcag ggagctgggt 
actgccctcc agggaggggg ctgcagggag ctgggtactg ccctccaggg agggggctgc 
agggagctgg gtactgccct ccagggaggc aggagcactg ttcccaacag agagcacatc 
ttcctgcagc agctgcacag acacaggagc ccccatgact gccctgggcc agggtgtgga 
ttccaaattt cgtgccccat tgggtgggac ggaggttgec cgtgacatcc aaggggcatc 
tgtgattcca aacttaaact actgtgccta caaaatagga aataacccta ctttttctac 
tatctcaaat tccctaagca caagctagca ccctttaaat caggaagttc agtcactcct 
ggggtcctcc catgccccca gtctgacttg caggtgcaca gggtggctga catctgtcct 
tgctcctcct cttggctcaa ctgccgcccc tcctgggggt gactgatggt caggacaagg 
gatcctagag ctggccccat gattgacagg aaggcaggac ttggcctcca ttctgaagac 
taggggtgtc aagagagctg ggcatcccac agagctgcac aagatgacgc ggacagaggg 



5160 

5220 

5280 

534.& 

5400 

5460 . 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 
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tgacacaggg ctcagggctt cagacgggtc gggaggctca gctgagagtt cagggacaga 6840 

cctgaggagc ctcagtggga aaagaagcac tgaagtggga agttctggaa tgttctggac 6900 

aagcctgagt gctctaagga aatgctccca ccccgatgta gcctgcagca ctggacggtc 6960 

tgtgtacctc cccgctgccc atcctctcac agcccccgcc tctagggaca caactcctgc 7020 

cctaacatgc atctttcctg tctcattcca cacaaaaggg cctctggggt ccctgttctg 7080 

cattgcaagg agtggaggtc acgttcccac agaccaccca gcaacagggt cctatggagg 7140 

tgcggtcagg aggatcacac gtccccccat gcccagggga ctgactctgg gggtgatgga 7200 

ttggcctgga ggccactggt cccctctgtc cctgagggga atctgcaccc tggaggctgc 7260 

cacatccctc ctgattcttt cagctgaggg cccttcttga aatcccaggg aggactcaac 7320 

ccccactggg aaaggcccag tgtggacggt tccacagcag cccagctaag gcccttggac 7380 

acy^gatcctg agtgagagaa cctttaggga cacaggtgca cggccatgtc cccagtgccc 7440 

a^cagagca ggggcatctg gaccctgagt gtgtagctcc cgcgactgaa cccagccctt 7500 

coecaatgac gtgacccctg gggtggctcc aggtctccag tccatgccac caaaatctcc 7560 

agfittgaggg tcctcccttg agtccctgat gcctgtccag gagctgcccc ctgagcaaat 7620 

c^fgagtgca gagggctggg attgtggcag taaaagcagc cacatttgtc tcaggaagga 7680 

allrggaggac atgagctcca ggaagggcga tggcgtcctc tagtgggcgc ctcctgttaa 7740 

t^agcaaaaa ggggccagga gagttgagag atcagggctg gccttggact aaggctcaga 7800 

tiggagaggac tgaggtgcaa agagggggct gaagtagggg agtggtcggg agagatggga 7860 

ggagcaggta aggggaagcc ccagggaggc cgggggaggg tacagcagag ctctccactc 7920 

ctcagcattg acatttgggg tggtcgtgct agtggggttc tgtaagttgt agggtgttca 7980 

gcaccatctg gggactctac ccactaaatg ccagcaggac tccctcccca agctctaaca 8040 

accaacaatg tctccagact ttccaaatgt cccctggaga gcaaaattgc ttctggcaga 8100 
atcactgatc tacgtcagtc tctaaaagtg actcatcagc gaaatccttc acctcttggg 8160 
agaagaatca caagtgtgag aggggtagaa actgcagact tcaaaatctt tccaaaagag 8220 
ttttacttaa tcagcagttt gatgtcccag gagaagatac atttagagtg tttagagttg 8280 
atgccacatg gctgcctgta cctcacagca ggagcagagt gggttttcca agggcctgta 8340 
accacaactg gaatgacact cactgggtta cattacaaag tggaatgtgg ggaattctgt 8400 
agactttggg aagggaaatg tatgacgtaa gcccacagcc taaggcagtg gacagtccac 8460 
tttgaggctc tcaccatcta ggagacatct cagccatgaa catagccaca tctgtcatta 8520 
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gaaaacatgt-tttattaaga ggaaaaatct aggctagaag tgctttatgc tcttttttct 8580 

ctttatgttc aaattcatat acttttagat cattccttaa agaacaatct atccccctaa 8640 

gtaaatgtta tcactgactg gatagtgttg gtgtctcact cccaacccct gtgtggtgac 8700 

agtgccctgc ttccccagcc ctgggccctc tctgattcct gagagctttg ggtgctcctt 87 GO- 

cattaggagg aagagaggaa gggtgttttt a^tattctca ccattcaccc atccaCctct 8820 

.tagacactgg gaagaatcag ttgcccactc ttggatttga tcctccaatt aatgacctct 8880 

atttctgtcc cttgtccatt tcaacaatgt gacaggccta agaggrgcct tctccatgtg 89.40 

atttttgagg agaaggttct caagataagt tttctcacac ctctttgaat tacctccacc 9000 

fgtgtcccca tcaccattac cagcagcatt tggacccttt ttctgttagt cagatgcttt 9060 

C#acctcttg agggtgtata ctgtatgctc tctacacagg aatatgcaga ggaaatagaa 9120 

fgiagggaaat cgcattacta ttcagagaga agaagacctt tatgtcaatg aatgagagtc 9180 

Ifliaaatccta agagagccca tataaaatta ttaccagtgc taaaactaca aaagttacac 9240 

tkacagtaaa ctagaataat aaaacatgca tcacagttgc tggtaaagct aaatcagata 9300 

jSttttttctt agaaaaagca ttccatgtgt gttgcagtga tgacaggagt gcccttcagt 9360 

f^laatatgctg cctgtaattt ttgttccctg gcagaatgta ttgtcttttc tccctttaaa 9420 

||cttaaatgc aaaactaaag gcagctcctg ggccccctcc ccaaagtcag ctgcctgcaa 9480 

ccagccccac gaagagcaga ggcctgagct tccctggtca aaatacgggg ctagggagct 9540 

taaccttgct cgataaagct gtgttcccag aatgtcgctc ctgttcccag gggcaccagc 9600 

ctggagggtg gtgagcctca ctggtggcct gatgcttacc ttgtgccctc acaccagtgg 9660 

tcactggaac cttgaacact tggctgtcgc ccggatctgc agatgtcaag aacttctgga 9720 

agtcaaatta ctgcccactt ctccagggca gatacctgtg aacatccaaa accatgccac 9780 

agaaccctgc ctggggtcta caacacatat ggactgtgag caccaagtcc agccctgaat 9840 

ctgtgaccac ctgccaagat gcccctaact gggatccacc aatcactgca catggcaggc 9900 

agcgaggctt ggaggtgctt cgccacaagg cagccccaat ttgctgggag tttcttggca 9960 

cctggtagtg gtgaggagcc ttgggaccct caggattact ccccttaagc atagtgggga 10020 

cccttctgca tccccagcag gtgccccgct cttcagagcc tctctctctg aggtttaccc 10080 

agacccctgc accaatgaga ccatgctgaa gcctcagaga gagagatgga gctttgacca 10140 

ggagccgctc ttccttgagg gccagggcag ggaaagcagg aggcaccacc aggagtggga 10200 
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acaccagtgt ctaagcccct gatgagaaca gggtggtctc tcccatatgc ccataccagg 10260 

cctgtgaaca gaatcctcct tctgcagtga caatgtctga gaggacgaca tgtttcccag 10320 

cctaacgtgc agccatgccc atctacccac tgcctactgc aggacagcac caacccagga 10380 

gctgggaagc tgggagaaga catggaatac ccatggcttc tcaccttcct ccagtccagt 10440 

gggcaccatt tatgcctagg acacccacct gccggcccca ggctcttaag agttaggtca 10500 

cctaggtgcc tctgggaggc cgaggcagga gaattgcttg aacccgggag gcagaggttg 10560 

cagtgagccg agatcacacc actgcactcc agcctgggtg acagaatgag actctgtctc 10620 

aaaaaaaaag agaaagatag catcagtggc taccaagggc taggggcagg ggaaggtgga 10680 

gagttaatga ttaatagtat gaagtttcta tgtgagatga tgaaaatgtt ctggaaaaaa 10740 

.i^atatagtg gtgaggatgt agaatattgt gaatataatt aacggcattt aattgtacac 10800 

Jttaacatgat taatgtggca tattttatct tatgtatttg actacatcca agaaacactg 10860 

;^^5agagggaa agcccaccat gtaaaataca cccaccctaa tcagatagtc ctcattgtac 10920 

iocaggtacag gcccctcatg acctgcacag gaataactaa ggatttaagg acatgaggct 10980 

stcccagccaa ctgcaggtgc acaacataaa tgtatctgca aacagactga gagtaaagct 11040 

|J[g9ggcacaa acctcagcac tgccaggaca cacacccttc tcgtggattc tgactttatc 11100 

Jitgacccggcc cactgtccag atcttgttgt gggattggga caagggaggt cataaagcct 11160 

•=i^tccccaggg cactctgtgt gagcacacga gacctcccca cccccccacc gttaggtctc 11220 

cacacataga tctgaccatt aggcattgtg aggaggactc tagcgcgggc tcagggatca 11280 

caccagagaa tcaggtacag agaggaagac ggggctcgag gagctgatgg atgacacaga 11340 

gcagggttcc tgcagtccac aggtccagct caccctggtg taggtgcccc atccccctga 11400 

tccaggcatc cctgacacag ctccctcccg gagcctcctc ccaggtgaca catcagggtc 11460 

cctcactcaa gctgtccaga gagggcagca ccttggacag cgcccacccc acttcactct 11520 

tcctccctca cagggctcag ggctcagggc tcaagtctca gaacaaatgg cagaggccag 11580 

tgagcccaga gatggtgaca gggcaatgat ccaggggcag ctgcctgaaa cgggagcagg 11640 

tgaagccaca gatgggagaa gatggttcag gaagaaaaat ccaggaatgg gcaggagagg 11700 

agaggaggac acaggctctg tggggctgca gcccaggatg ggactaagtg tgaagacatc 11760 

tcagcaggtg aggccaggtc ccatgaacag agaagcagct cccacctccc ctgatgcacg 11820 

gacacacaga gtgtgtggtg ctgtgccccc agagtcgggc tctcctgttc tggtccccag 11880 

ggagtgagaa gtgaggttga cttgtccctg ctcctctctg ctaccccaac attcaccttc 11940 
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tcctcatgcc-cctctctctc aaatatgatt C^gatctatg tccccgccca aatctcatgt 
caaattgtaa accccaatgt tggaggtggg gccttgtgag aagtgattgg ataatgcggg 
tggattttct gctttgatgc tgtttctgtg atagagatct cacatgatct ggttgtttaa 
aagtgtgtag cacctctccc ctctctctct ctctctctta ctcatgctct gccatgtaag 
acgttcctgt ttccccttca ccgtccagaa tgattgtaag ttttctgagg cctccccagg 
agcagaagcc actatgcttc ctgtacaact gcagaatgat gagcgaatta aacctctttt 
ctttataaat tacccagtct caggtatttc tttatagcaa tgcgaggaca gactaataca 
atcttctact cccagatccc cgcacacgct tagccccaga catcactgcc cctgggagca 
tgcacagcgc agcctcctgc cgacaaaagc aaagtcacaa aaggtgacaa aaatctgcat 
t;t|ggggacat ctgattgtga aagagggagg acagtacact tgtagccaca gagactgggg 
<|icaccgagc tgaaacctgg tagcactttg gcataacatg tgcatgaccc gtgttcaatg 
^ctagagatc agtgttga^t aaaacagcct ggtctggggc cgctgctgtc cccacttccc 
i^ctgtccac cagagggcgg cagagttcct cccaccctgg agcctcccca ggggctgctg 
acctccctca gccgggccca cagcccagca gggtccaccc tcacccgggt cacctcggcc 
t#icgtcctcc tcgccctccg agctcctcac acggactctg tcagctcctc cctgcagcct 
atcggccgcc cacctgaggc ttgtcggccg cccacttgag gcctgtcggc tgccctctgc 
^ggcagctcc tgtcccctac accccctcct tccccgggct cagctgaaag ggcgtctccc 
agggcagctc cctgtgatct ccaggacagc tcagtctctc acaggctccg acgcccccta 
tgctgtcacc tcacagccct gtcattacca ttaactcctc agtcccatga agttcactga 
gcgcctgtct cccggttaca ggaaaactct gtgacaggga ccacgtctgt cctgctctct 
gtggaatccc agggcccagc ccagtgcctg acacggaaca gatgctccat aaatactggt 
taaatgtgtg ggagatctct aaaaagaagc atatcacctc cgtgtggccc ccagcagtca 
gagtctgttc catgtggaca caggggcact ggcaccagca tgggaggagg ccagcaagtg 
cccgcggctg ccccaggaat gaggcctcaa cccccagagc ttcagaaggg aggacagagg 
cctgcaggga atagatcctc cggcctgacc ctgcagccta atccagagtt cagggtcagc 
tcacaccacg tcgaccctgg tcagcatccc tagggcagtt ccagacaagg ccggaggtct 
cctcttgccc tccagggggt gacattgcac acagacatca ctcaggaaac ggattcccct 
ggacacgaac ctggctttgc taaggaag1:g gaggtggagc ctggtttcca tcccttgctc 



12OD0 
12060 
12120 
12100 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
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caacagaccc ttctgatctc tcccacatac ctgctctgtt cctttctcgg tcctatgagg 13680 

accctgttct gccaggggtc cctgtgcaac tccagactcc ctcctggtac caccatgggg 13740 

aaggtggggt gatcacagga cagtcagcct cgcagagaca gagaccaccc aggactgtca 13800 

gggagaacat ggacaggccc tgagccgcag ctcagccaac agacacggag agggagggtc 13860 

cccctggagc cttccccaag gacagcagag cccagagtca cccacctccc tccaccacag 1392Q 

tcctctcttt ccaggacaca caagacacct ccccctccac atgcaggatc tggggactcc 13980 

tgagacctct gggcctgggt ctccatccct gggtcagtgg cggggttggt ggtactggag 14040 

acagagggct ggtccctccc cagccaccac ccagtgagcc tttttctagc ccccagagcc 14100 

acctctgtca ccttcctgtt gggcatcatc ccaccttccc agagccctgg agagcatggg 14160 

^Sgacccggg accctgctgg gtttctctgt cacaaaggaa aataatcccc ctggtgtgac 14220 

4gacccaagg acagaacaca gcagaggtca gcactgggga agacaggttg tcctcccagg 14280 

^§atgggggt ccatccacct tgccgaaaag atttgtctga ggaactgaaa atagaaggga 14340 

likaaagalgga gggacaaaag aggcagaaat gagaggggag cggacagacg acacctgaat 14 400 

aiaaacaccaca. cccatgaccc acgtgatgct gagaagtact cctgccctag gaagagactc 14460 

kl transcription start site -i^r-^^ 

jagggcagagg gaggaaggac agcagaccag acagtcacag cagccttgac aaaacgttcc 14520 



-Cggaactcaa gctcttctcc acagaggagg acagagcaga cagcagagac catgoagtct 14580 

J^^CQtcggccc ctccccacag atggtgcatc ccctggcaga ggctcctgct cacacgtgaa 14640 

gggaggacaa cctgggagag ggtgggagga gggagctggg gtctcctgcg taggacaggg 14700 

ctgtgagacg gacagagggc tcctgttgga gcctgaatag cgaagaggac atcagagagg 14760 

gacaggagtc acaccagaaa aatcaaattg aactggaatt gcaaagggcc aggaaaacct 14820 

caacagttct attttcctag ttaattgtca ctggccacta cgtttttaea aatcetaata 14880 

actccatcag atgacacttt aaataaaaac ataaccaggg catgaaacec tgtcctcatc 14 940 

cgcctaccgc ggacattgga aaataagccc caggctgtgg agggccctgg gaaccctcat 15000 

gaactcatcc acaggaatct gcagcctgtc ccaggcactg gggtgcaacc aagatc 15056 
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Mucin-TRE SEQ ID NO: 



cgagcggccc ctcagcttcg gcgcccagcc 

gggaggagct cctggccagt ggtggagagt 
120 

cccaggttta ctcccttaag tggaaatttc 
180 

9^ggga^ccc aggctgctgg aaagtccggc 
240 

cggggtgtgg aacgggacag ggagcggtta 
300 



ggggagggag cccaaaacta gcacctagtc 
360 

ccgctctgct tcagtggacc cggggagggc 
420 

gcttcccgac cttgctgtac aggacctcga 
480 

tagttgttgc cctgaggcta aaactagagc 
540 

ccccctcccc cggagccagg gagtggttgg 
600 

gggtagtcag ggggttgagc gattagagcc 
660 



gaggaggaag aggtaggagg taggggaggg 
720 



ctgtgcctag ggcgggcggg cggggagtgg 
780 



gcccgctcca cctctcaagc agccagcgcc 
840 

catttcacca ccaccatg 
858 



ccgcaaggct cccggtgacc actagagggc 60 
ggcaaggaag gaccctaggg ttcatcggag 

ttcccccact cctccttggc tttctccaag 

tggggcgggg actgtgggtt caggggagaa 

gaagggtggg gctattccgg gaagtggtgg 

cactcattat ccagccctct tatttctcgg 

ggggaagtgg agtgggagac ctaggggtgg 

cctagctggc tttgttcccc atccccacgt 

ccaggggccc caagttccag actgcccctc 

tgaaaggggg aggccagctg gagaacaaac 

cttgtaccct acccaggaat ggttggggag 

ggcggggttt tgtcacctgt cacctgctcg 

ggggaccggt ataaagcggt aggcgcctgt 

tgcctgaatc tgttctgccc cctccccacc 



aFP-TRE SEQIDNO:_ 

gaattcttag aaatatgggg gtaggggtgg 

gagataagca ttgggttaaa tgtgctttca 
12 0 

aacagactat gggctggagg actttgagga 
180 



tggtggtaat tctgttttca ccccataggt 60 
cacacacatc acatttcata agaattaagg 

tgtctgtctc ataacacttg ggttgtatct 
203 



gttctatggg gcttgtttta 
240 

aagcccaagg tactgtcctc 
300 

gagaaaatat aaacatttca 
360 

tcatttgtat caatgaatga 
420 

gagggtaggg gtggtactat 
480 

tgcttgtgac aaatgttttg 
540 

aagaagttaa tggtccagga 
600 

acaagaggtt ccatgagaaa 
660 

ctcttctcta gctagagact 
720 

taagggttgg gtacattttg 
780 

attttcaact aaccaaccac 
840 

gtgagataag agaaggaatt 
900 

gtatatatct aatatttaac 
960 

tttaatgtct ataagtacca 
1020 

acttgagggg atagatcatt 
1080 

ttgcctgtca tacagctaat 

1140 

ctttctaata ccaaagttca 
1200 

tatgaaaaag taaacggaat 
1260 

acttaggact aaactttgag 
1320 

acaaactctg gttcaaagct 
1380 



agcttggcaa cttgcaacag 

ttttcatatc tgttttgggg 

ataatgttct gtggtgagat 

atgaggacaa ttagtgtata 

tcaatttcta tttataaaga 

ttcgggacca caggaatcac 

ataattacat agcttacaaa 

ataatctgaa aggtttaata 

aatcagaaat acattcaggg 

ttcaagcatt gatggagaag 

ccaatccaac aaacaaaaaa 

ttctcacaac ccacacgtat 

actaacatca tgctaataat 

ggcatttaga agatattatt 

ttcatgatat atgagaaaaa 

aattgaccat aagacaatta 

gtttactgtt ccatgttgct 

cagaattaca tcaatgcaaa 

caataacaca catagattga 

cctctttatt gcttgtcttg 
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ggttcactga ctttctcccc 
cctctggggc ttgaatatct 
gagtatgaga gatgtgtcat 
aatccttagt acaacaatct 
tacttatttc tatttattta 
aaagatgagt ctttgaattt 
tgactatgat ataccatcaa 
agttgtcaaa ggtgagaggg 
ataattattt gaatagacct 
gagagtgaat atttgaaaac 
tgaaaagaat ctcagaaaca 
agctcaactg ctctgaagaa 
gataataatt actgtcattt 
ccatttatat atcaaaataa 
ttaaaaacag attgaattat 
gatttaaatt agttttgaat 
tctgagtggc ttcacagact 
agcattgctg tgaactctgt 
ggattgtttg ctgttagcat 
gaaaatttgc tgttcttcat 



ggtttctctt ttcactgcta 
1440 

tgcaagctta tgattcccaa 
1500 

agtagtattc aaatgcacat 
1560 

caaaccgggg agttttgcct 
1620 

tcttgttcct taaaatctaa 
1680 

agagtgaaaa gggagcctga 

1740 

gtttgggtaa actggtcact 
1800 

agttactaag tctttgactt 
1860 

ttgacagtat tattgcgaaa 
1920 

tgaaacaaaa gagacagttt 
1980 

ctataatagt cccagaagga 
2040 

tgccaacttt gccaggaatt 
2100 

ttcaaaactg cattttctct 
2160 

tttccaccac cttttacatt 
2220 

tctaattttc tttcagatct 
2280 



cctccaaaga tgtcatgagt 
2340 



tgttgtgatt actgatacta 
2400 

tacatggtgc ctgtctcttg 
2460 



gacttaacat tttattgaat 

2520 



agacccggtt ctgaggcatt 
2580 



tctatttttc tcaaccactc 

atatctatct ctagcctcaa 

caacgtctcc acttggaggg 

ggaatgtttc ctaaaatgtg 

ttacttttag cccagtgctc 

ttaataatta cactaagtca 

ttatcttaaa ctaaatatat 

tatctcattc ataccactca 

acttcctaac tggtctcctt 

caaaatacaa atatgatttt 

gttataaact ccatttaaaa 

cccaatatct agtattttct 

cattccctaa gtgtgcattg 

ttcctggaac actataccct 

ccatgaagat gttacttcct 

tcctcttttc attctactaa 

ttgtctgttt ctctgattag 

ttgctgatta ttcccatcca 

gaataaataa aaccccatct 

tatatttatt gatttattta 
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acatggctac aataactgtc 
tcttgttcca gaagataaaa 
cttaaagacg tttcaacata 
tcctgtagca catagggtcc 
atcccaccta tggggagatg 
ataggcatag agccaggact 
ccaaaactga acatgtactt 
gctttatcca ggccacttat 
atcatagtct tatccccttt 
tattagctcc cttttgttgt 
agtctttgag atgtggccct 
actattaaac tttgtgcctc 
ttttccctta ccggttggtt 
ccctcttcat ttggcccacc 
ccaggaagcc ttatctgacc 
tcacagcatc catcacacca 
gcagtaagct caacaagagc 
aaaacagtgc ctggaatgca 
atcgagtgct actttgtgca 
attctcattt aaccatgaag 



gaggtactat cactatcctt attttatagt tgataaagat aaagcccaga gaaatgaatt 
2640 



aactcaccca aagtcatgta gctaagtgac agggcaaaaa ttcaaaccag ttccccaact 
2700 



ttacgtgatt aatactgtgc tatactgcct ctctgatcat atggcatgga atgcagacat 
2760 

ctgctccgta aggcagaata tggaaggaga ttggaggatg acacaaaacc agcataatat 
2820 

cagaggaaaa gtccaaacag gacctgaact gatagaaaag ttgttactcc tggtgtagtc 
2880 

gcatcgacat cttgatgaac tggtggctga cacaacatac attggcttga tgtgtacata 
2940 

ttatttgtag ttgtgtgtgt atttttatat atatatttgt aatattgaaa tagtcataat 
3000 

ttactaaagg cctaccattt gccaggcatt tttacatttg tcccctctaa tcttttgatg 
3060 

agatgatcag attggattac ttggccttga agatgatata tctacatcta tatctatatc 
3120 



tatatctata tctatatcta tatctatatc tatatctata tatgtatatc agaaaagctg 
3180 



aaatatgttt tgtaaagtta taaagatttc agactttata gaatctggga tttgccaaat 
3240 



gtaacccctt tctctacatt aaacccatgt tggaacaaat acatttatta ttcattcatc 
3300 



aaatgttgct gagtcctggc tatgaaccag acactgtgaa agcctttggg atattttgcc 
3360 



catgcttggg caagcttata tagtttgctt cataaaactc tatttcagtt cttcataact 
3420 



aatacttcat gactattgct tttcaggtat tccttcataa caaatacttt ggctttcata 
3480 



tatttgagta aagtccccct tgaggaagag tagaagaact gcactttgta aatactatcc 
3540 



tggaatccaa acggatagac aaggatggtg ctacctcttt ctggagagta cgtgagcaag 
3600 



gcctgttttg ttaacatgtt ccttaggaga caaaacttag gagagacacg catagcagaa 
3660 



aatggacaaa aactaacaaa tgaatgggaa ttgtacttga ttagcattga agaccttgtt 
3720 



tatactatga taaatgtttg tatttgctgg aagtgctact gacggtaaac cctttttgtt 
3780 
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taaatgtgtg ccctagtagc 
3840 

tttaaaaatt ttatgtttaa 
3900 

agatagaatt aaattcactt 
3960 

tccattattt ctgtctttta 
4020 

gtacaatgat acacaaatga 
4080 

acaatatatt atgagcatat 
4140 

ggctgtatat caccctactt 
4200 

tttgatttta taatcttacc 
4260 

ggtgcctggg tctcaactcc 
4320 

ttgggaattc aaaaagttcc 
4380 

tagacaggga tgataggagg 
4440 

catcttgagt tgctaagtga 
4500 

taaaattatt ataggacttg 
4560 

atacatcgca tactaaatac 
4620 

tgtttctgag ctaaacaatg 
4680 

attattctgc aacttaggga 
4740 

gttatatttg caagatgaag 
4800 

tgataggcat ttaatagttt 
4860 

gctgtctttt ctttatggct 
4920 

tagggacaag tcatgtcttt 
4980 



ttgcagtatg atctattttt 
aattgcatag tgctctttca 
atcttaccat ctagagaaac 
ttcaacattt tttttagagg 
gagcactctc catgtattgt 
ttccatttca ttaaatattc 
tatgaatgta ccatattaat 
tttagaataa tgaaacacct 
acagattctg atttaactgg 
cccagtgatt ctaatgtgta 
tgagccactc ttagcatcca 
atgatgcacc tgacccactt 
gtttattagg gcttgtgctc 
tttaaaatgt accttattga 
acagcataat tatcaagcaa 
caagtcatct ctctgaattt 
agtctgaatt ggtcagacaa 
taaagaatta atgtatttag 
tcattaactt aatttgagag 
gaatattctg tagtttgagg 
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taagtactgt acttagctta 
ttgaagaagt tttgagagag 
ccaatgttaa aactttgttg 
gtgggaggaa tacagaggag 
tttgtcctgt ttttcagtta 
ttccacaaag ttattttgat 
ttatttcctg gtgtgggtta 
gtgaagcttt agaaaatact 
tctgggttac agactaggca 
gccaagatcg ggaacccttg 
tcatttagta ttaacatcat 
tataaagaca catgtgcaaa 
taagttttct atgttaagcc 
catacatatt aagtgaaaag 
tgataatttg aaatgaattt 
tttgtacttt gagagtattt 
tgtcttgtgt gcctggcata 
atgaattgca taccaaatct 
aaattaatta ttctgcaact 
agaatatttg ttatatttgc 



aaaataaaat aagtttgcaa gttttttttt 
5040 

ataaaataca aataaccgct atgctgttaa 
5100 

ggaaatacca taaagtaaca gatataccaa 
5160 

gaaaagagta taaaagaatt tcagcatgat 
5220 

aaca 
5224 



tctgccccaa agagctctgt gtccttgaac 
ttattggcaa atgtcccatt ttcaacctaa 
caaaaggtta ctagttaaca ggcattgcct 
tttccatatt gtgcttccac cactgccaat 
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